Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 529 |
| Missing cells | 1066 |
| Missing cells (%) | 10.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 78.7 KiB |
| Average record size in memory | 152.2 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 10 |
nbinsta is highly overall correlated with nbtik and 1 other fields | High correlation |
nbtwit is highly overall correlated with twitter | High correlation |
nbsnap is highly overall correlated with snapchat | High correlation |
nbtik is highly overall correlated with nbinsta and 1 other fields | High correlation |
googp is highly overall correlated with googmp | High correlation |
googmp is highly overall correlated with googp | High correlation |
instagra is highly overall correlated with nbinsta | High correlation |
twitter is highly overall correlated with nbtwit | High correlation |
snapchat is highly overall correlated with nbsnap | High correlation |
tiktok is highly overall correlated with nbtik | High correlation |
instagra is highly imbalanced (79.4%) | Imbalance |
nbinsta has 21 (4.0%) missing values | Missing |
nbtwit has 372 (70.3%) missing values | Missing |
snapchat has 7 (1.3%) missing values | Missing |
nbsnap has 259 (49.0%) missing values | Missing |
nbtik has 332 (62.8%) missing values | Missing |
instap has 8 (1.5%) missing values | Missing |
snapp has 13 (2.5%) missing values | Missing |
googp has 10 (1.9%) missing values | Missing |
googmp has 10 (1.9%) missing values | Missing |
random_id4 has unique values | Unique |
nbtwit has 19 (3.6%) zeros | Zeros |
instap has 222 (42.0%) zeros | Zeros |
snapp has 421 (79.6%) zeros | Zeros |
googp has 143 (27.0%) zeros | Zeros |
googmp has 190 (35.9%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-06 10:12:42.794866 |
|---|---|
| Analysis finished | 2024-02-06 10:12:47.146318 |
| Duration | 4.35 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
random_id4
Real number (ℝ)
UNIQUE 
| Distinct | 529 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5116062.9 |
| Minimum | 37320 |
|---|---|
| Maximum | 9973576 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 37320 |
|---|---|
| 5-th percentile | 647120 |
| Q1 | 2925179 |
| median | 5121654 |
| Q3 | 7503167 |
| 95-th percentile | 9217394.2 |
| Maximum | 9973576 |
| Range | 9936256 |
| Interquartile range (IQR) | 4577988 |
Descriptive statistics
| Standard deviation | 2788763.2 |
|---|---|
| Coefficient of variation (CV) | 0.54509948 |
| Kurtosis | -1.1307902 |
| Mean | 5116062.9 |
| Median Absolute Deviation (MAD) | 2330145 |
| Skewness | -0.073446117 |
| Sum | 2.7063972 × 109 |
| Variance | 7.7772001 × 1012 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 37320 | 1 | 0.2% |
| 6583444 | 1 | 0.2% |
| 6967343 | 1 | 0.2% |
| 6953254 | 1 | 0.2% |
| 6865922 | 1 | 0.2% |
| 6837858 | 1 | 0.2% |
| 6820649 | 1 | 0.2% |
| 6818054 | 1 | 0.2% |
| 6757890 | 1 | 0.2% |
| 6752959 | 1 | 0.2% |
| Other values (519) | 519 |
| Value | Count | Frequency (%) |
| 37320 | 1 | |
| 75227 | 1 | |
| 78695 | 1 | |
| 80831 | 1 | |
| 140145 | 1 | |
| 142591 | 1 | |
| 154015 | 1 | |
| 157492 | 1 | |
| 180228 | 1 | |
| 215268 | 1 |
| Value | Count | Frequency (%) |
| 9973576 | 1 | |
| 9956964 | 1 | |
| 9947814 | 1 | |
| 9940978 | 1 | |
| 9928746 | 1 | |
| 9897608 | 1 | |
| 9816676 | 1 | |
| 9801565 | 1 | |
| 9745640 | 1 | |
| 9741187 | 1 |
survey
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.3 KiB |
| ScPoBx_1A | |
|---|---|
| ScPoBx_3A |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 4761 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ScPoBx_3A |
|---|---|
| 2nd row | ScPoBx_3A |
| 3rd row | ScPoBx_3A |
| 4th row | ScPoBx_1A |
| 5th row | ScPoBx_1A |
Common Values
| Value | Count | Frequency (%) |
| ScPoBx_1A | 299 | |
| ScPoBx_3A | 230 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| scpobx_1a | 299 | |
| scpobx_3a | 230 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 529 | |
| c | 529 | |
| P | 529 | |
| o | 529 | |
| B | 529 | |
| x | 529 | |
| _ | 529 | |
| A | 529 | |
| 1 | 299 | |
| 3 | 230 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2116 | |
| Lowercase Letter | 1587 | |
| Connector Punctuation | 529 | 11.1% |
| Decimal Number | 529 | 11.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 529 | |
| P | 529 | |
| B | 529 | |
| A | 529 |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 529 | |
| o | 529 | |
| x | 529 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 299 | |
| 3 | 230 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3703 | |
| Common | 1058 | 22.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 529 | |
| c | 529 | |
| P | 529 | |
| o | 529 | |
| B | 529 | |
| x | 529 | |
| A | 529 |
Common
| Value | Count | Frequency (%) |
| _ | 529 | |
| 1 | 299 | |
| 3 | 230 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4761 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 529 | |
| c | 529 | |
| P | 529 | |
| o | 529 | |
| B | 529 | |
| x | 529 | |
| _ | 529 | |
| A | 529 | |
| 1 | 299 | |
| 3 | 230 |
tps_rs
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 4 |
| Missing (%) | 0.8% |
| Memory size | 4.3 KiB |
| 3.0 | |
|---|---|
| 2.0 | |
| 1.0 | |
| 4.0 | |
| 5.0 | 12 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1575 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 4.0 |
Common Values
| Value | Count | Frequency (%) |
| 3.0 | 206 | |
| 2.0 | 172 | |
| 1.0 | 76 | 14.4% |
| 4.0 | 59 | 11.2% |
| 5.0 | 12 | 2.3% |
| (Missing) | 4 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3.0 | 206 | |
| 2.0 | 172 | |
| 1.0 | 76 | 14.5% |
| 4.0 | 59 | 11.2% |
| 5.0 | 12 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 525 | |
| 0 | 525 | |
| 3 | 206 | 13.1% |
| 2 | 172 | 10.9% |
| 1 | 76 | 4.8% |
| 4 | 59 | 3.7% |
| 5 | 12 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1050 | |
| Other Punctuation | 525 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 525 | |
| 3 | 206 | 19.6% |
| 2 | 172 | 16.4% |
| 1 | 76 | 7.2% |
| 4 | 59 | 5.6% |
| 5 | 12 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 525 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1575 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 525 | |
| 0 | 525 | |
| 3 | 206 | 13.1% |
| 2 | 172 | 10.9% |
| 1 | 76 | 4.8% |
| 4 | 59 | 3.7% |
| 5 | 12 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1575 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 525 | |
| 0 | 525 | |
| 3 | 206 | 13.1% |
| 2 | 172 | 10.9% |
| 1 | 76 | 4.8% |
| 4 | 59 | 3.7% |
| 5 | 12 | 0.8% |
instagra
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 3 |
| Missing (%) | 0.6% |
| Memory size | 4.3 KiB |
| J'ai | |
|---|---|
| Je n'ai pas | 17 |
Length
| Max length | 11 |
|---|---|
| Median length | 4 |
| Mean length | 4.2262357 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2223 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | J'ai |
|---|---|
| 2nd row | J'ai |
| 3rd row | J'ai |
| 4th row | J'ai |
| 5th row | J'ai |
Common Values
| Value | Count | Frequency (%) |
| J'ai | 509 | |
| Je n'ai pas | 17 | 3.2% |
| (Missing) | 3 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| j'ai | 509 | |
| je | 17 | 3.0% |
| n'ai | 17 | 3.0% |
| pas | 17 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 543 | |
| J | 526 | |
| ' | 526 | |
| i | 526 | |
| 34 | 1.5% | |
| e | 17 | 0.8% |
| n | 17 | 0.8% |
| p | 17 | 0.8% |
| s | 17 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1137 | |
| Uppercase Letter | 526 | |
| Other Punctuation | 526 | |
| Space Separator | 34 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 543 | |
| i | 526 | |
| e | 17 | 1.5% |
| n | 17 | 1.5% |
| p | 17 | 1.5% |
| s | 17 | 1.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 526 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 526 |
Space Separator
| Value | Count | Frequency (%) |
| 34 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1663 | |
| Common | 560 | 25.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 543 | |
| J | 526 | |
| i | 526 | |
| e | 17 | 1.0% |
| n | 17 | 1.0% |
| p | 17 | 1.0% |
| s | 17 | 1.0% |
Common
| Value | Count | Frequency (%) |
| ' | 526 | |
| 34 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2223 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 543 | |
| J | 526 | |
| ' | 526 | |
| i | 526 | |
| 34 | 1.5% | |
| e | 17 | 0.8% |
| n | 17 | 0.8% |
| p | 17 | 0.8% |
| s | 17 | 0.8% |
nbinsta
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 280 |
|---|---|
| Distinct (%) | 55.1% |
| Missing | 21 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 438.31299 |
| Minimum | 18 |
|---|---|
| Maximum | 3758 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 91.35 |
| Q1 | 200 |
| median | 354.5 |
| Q3 | 580 |
| 95-th percentile | 1034.75 |
| Maximum | 3758 |
| Range | 3740 |
| Interquartile range (IQR) | 380 |
Descriptive statistics
| Standard deviation | 352.96269 |
|---|---|
| Coefficient of variation (CV) | 0.80527543 |
| Kurtosis | 20.310765 |
| Mean | 438.31299 |
| Median Absolute Deviation (MAD) | 161 |
| Skewness | 3.165841 |
| Sum | 222663 |
| Variance | 124582.66 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 400 | 19 | 3.6% |
| 500 | 18 | 3.4% |
| 300 | 16 | 3.0% |
| 200 | 16 | 3.0% |
| 150 | 12 | 2.3% |
| 600 | 11 | 2.1% |
| 450 | 10 | 1.9% |
| 250 | 10 | 1.9% |
| 350 | 8 | 1.5% |
| 900 | 7 | 1.3% |
| Other values (270) | 381 | |
| (Missing) | 21 | 4.0% |
| Value | Count | Frequency (%) |
| 18 | 1 | 0.2% |
| 30 | 1 | 0.2% |
| 37 | 1 | 0.2% |
| 38 | 1 | 0.2% |
| 39 | 1 | 0.2% |
| 50 | 1 | 0.2% |
| 53 | 1 | 0.2% |
| 54 | 1 | 0.2% |
| 60 | 4 | |
| 70 | 2 |
| Value | Count | Frequency (%) |
| 3758 | 1 | 0.2% |
| 2888 | 1 | 0.2% |
| 2000 | 1 | 0.2% |
| 1750 | 1 | 0.2% |
| 1600 | 2 | |
| 1524 | 1 | 0.2% |
| 1400 | 2 | |
| 1300 | 3 | |
| 1235 | 1 | 0.2% |
| 1200 | 3 |
twitter
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 5 |
| Missing (%) | 0.9% |
| Memory size | 4.3 KiB |
| Je n'ai pas | |
|---|---|
| J'ai |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 8.8759542 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4651 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Je n'ai pas |
|---|---|
| 2nd row | Je n'ai pas |
| 3rd row | J'ai |
| 4th row | Je n'ai pas |
| 5th row | Je n'ai pas |
Common Values
| Value | Count | Frequency (%) |
| Je n'ai pas | 365 | |
| J'ai | 159 | |
| (Missing) | 5 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| je | 365 | |
| n'ai | 365 | |
| pas | 365 | |
| j'ai | 159 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 889 | |
| 730 | ||
| J | 524 | |
| ' | 524 | |
| i | 524 | |
| e | 365 | |
| n | 365 | |
| p | 365 | |
| s | 365 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2873 | |
| Space Separator | 730 | 15.7% |
| Uppercase Letter | 524 | 11.3% |
| Other Punctuation | 524 | 11.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 889 | |
| i | 524 | |
| e | 365 | |
| n | 365 | |
| p | 365 | |
| s | 365 |
Space Separator
| Value | Count | Frequency (%) |
| 730 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 524 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 524 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3397 | |
| Common | 1254 | 27.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 889 | |
| J | 524 | |
| i | 524 | |
| e | 365 | |
| n | 365 | |
| p | 365 | |
| s | 365 |
Common
| Value | Count | Frequency (%) |
| 730 | ||
| ' | 524 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 889 | |
| 730 | ||
| J | 524 | |
| ' | 524 | |
| i | 524 | |
| e | 365 | |
| n | 365 | |
| p | 365 | |
| s | 365 |
nbtwit
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 63 |
|---|---|
| Distinct (%) | 40.1% |
| Missing | 372 |
| Missing (%) | 70.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.745223 |
| Minimum | 0 |
|---|---|
| Maximum | 1907 |
| Zeros | 19 |
| Zeros (%) | 3.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 15 |
| Q3 | 50 |
| 95-th percentile | 214.2 |
| Maximum | 1907 |
| Range | 1907 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 191.64858 |
|---|---|
| Coefficient of variation (CV) | 2.8713453 |
| Kurtosis | 59.520178 |
| Mean | 66.745223 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 7.0304081 |
| Sum | 10479 |
| Variance | 36729.178 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19 | 3.6% |
| 3 | 10 | 1.9% |
| 2 | 10 | 1.9% |
| 10 | 10 | 1.9% |
| 50 | 7 | 1.3% |
| 20 | 7 | 1.3% |
| 30 | 6 | 1.1% |
| 150 | 5 | 0.9% |
| 1 | 4 | 0.8% |
| 15 | 4 | 0.8% |
| Other values (53) | 75 | 14.2% |
| (Missing) | 372 |
| Value | Count | Frequency (%) |
| 0 | 19 | |
| 1 | 4 | 0.8% |
| 2 | 10 | |
| 3 | 10 | |
| 4 | 3 | 0.6% |
| 5 | 4 | 0.8% |
| 6 | 3 | 0.6% |
| 7 | 2 | 0.4% |
| 8 | 1 | 0.2% |
| 9 | 2 | 0.4% |
| Value | Count | Frequency (%) |
| 1907 | 1 | |
| 1050 | 1 | |
| 763 | 1 | |
| 400 | 1 | |
| 364 | 1 | |
| 351 | 1 | |
| 300 | 1 | |
| 271 | 1 | |
| 200 | 2 | |
| 193 | 1 |
snapchat
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 7 |
| Missing (%) | 1.3% |
| Memory size | 4.3 KiB |
| J'ai | |
|---|---|
| Je n'ai pas |
Length
| Max length | 11 |
|---|---|
| Median length | 4 |
| Mean length | 7.2854406 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3803 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Je n'ai pas |
|---|---|
| 2nd row | J'ai |
| 3rd row | Je n'ai pas |
| 4th row | J'ai |
| 5th row | J'ai |
Common Values
| Value | Count | Frequency (%) |
| J'ai | 277 | |
| Je n'ai pas | 245 | |
| (Missing) | 7 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| j'ai | 277 | |
| je | 245 | |
| n'ai | 245 | |
| pas | 245 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 767 | |
| J | 522 | |
| ' | 522 | |
| i | 522 | |
| 490 | ||
| e | 245 | 6.4% |
| n | 245 | 6.4% |
| p | 245 | 6.4% |
| s | 245 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2269 | |
| Uppercase Letter | 522 | 13.7% |
| Other Punctuation | 522 | 13.7% |
| Space Separator | 490 | 12.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 767 | |
| i | 522 | |
| e | 245 | 10.8% |
| n | 245 | 10.8% |
| p | 245 | 10.8% |
| s | 245 | 10.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 522 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 522 |
Space Separator
| Value | Count | Frequency (%) |
| 490 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2791 | |
| Common | 1012 | 26.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 767 | |
| J | 522 | |
| i | 522 | |
| e | 245 | 8.8% |
| n | 245 | 8.8% |
| p | 245 | 8.8% |
| s | 245 | 8.8% |
Common
| Value | Count | Frequency (%) |
| ' | 522 | |
| 490 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3803 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 767 | |
| J | 522 | |
| ' | 522 | |
| i | 522 | |
| 490 | ||
| e | 245 | 6.4% |
| n | 245 | 6.4% |
| p | 245 | 6.4% |
| s | 245 | 6.4% |
nbsnap
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 62 |
|---|---|
| Distinct (%) | 23.0% |
| Missing | 259 |
| Missing (%) | 49.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.52963 |
| Minimum | 0 |
|---|---|
| Maximum | 2000 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 40 |
| median | 100 |
| Q3 | 150 |
| 95-th percentile | 327.5 |
| Maximum | 2000 |
| Range | 2000 |
| Interquartile range (IQR) | 110 |
Descriptive statistics
| Standard deviation | 171.41245 |
|---|---|
| Coefficient of variation (CV) | 1.3547218 |
| Kurtosis | 55.922819 |
| Mean | 126.52963 |
| Median Absolute Deviation (MAD) | 50 |
| Skewness | 6.0736892 |
| Sum | 34163 |
| Variance | 29382.228 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 38 | 7.2% |
| 50 | 30 | 5.7% |
| 150 | 27 | 5.1% |
| 200 | 23 | 4.3% |
| 30 | 19 | 3.6% |
| 40 | 15 | 2.8% |
| 20 | 13 | 2.5% |
| 60 | 11 | 2.1% |
| 10 | 9 | 1.7% |
| 300 | 9 | 1.7% |
| Other values (52) | 76 | 14.4% |
| (Missing) | 259 |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.2% |
| 3 | 2 | 0.4% |
| 4 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| 8 | 1 | 0.2% |
| 10 | 9 | |
| 12 | 1 | 0.2% |
| 15 | 3 | 0.6% |
| 20 | 13 | |
| 28 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 2000 | 1 | 0.2% |
| 1000 | 1 | 0.2% |
| 700 | 2 | |
| 615 | 1 | 0.2% |
| 600 | 3 | |
| 500 | 1 | 0.2% |
| 450 | 1 | 0.2% |
| 400 | 1 | 0.2% |
| 354 | 1 | 0.2% |
| 350 | 2 |
tiktok
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 4 |
| Missing (%) | 0.8% |
| Memory size | 4.3 KiB |
| Je n'ai pas | |
|---|---|
| J'ai |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 8.3466667 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4382 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Je n'ai pas |
|---|---|
| 2nd row | J'ai |
| 3rd row | J'ai |
| 4th row | J'ai |
| 5th row | J'ai |
Common Values
| Value | Count | Frequency (%) |
| Je n'ai pas | 326 | |
| J'ai | 199 | |
| (Missing) | 4 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| je | 326 | |
| n'ai | 326 | |
| pas | 326 | |
| j'ai | 199 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 851 | |
| 652 | ||
| J | 525 | |
| ' | 525 | |
| i | 525 | |
| e | 326 | 7.4% |
| n | 326 | 7.4% |
| p | 326 | 7.4% |
| s | 326 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2680 | |
| Space Separator | 652 | 14.9% |
| Uppercase Letter | 525 | 12.0% |
| Other Punctuation | 525 | 12.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 851 | |
| i | 525 | |
| e | 326 | 12.2% |
| n | 326 | 12.2% |
| p | 326 | 12.2% |
| s | 326 | 12.2% |
Space Separator
| Value | Count | Frequency (%) |
| 652 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 525 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 525 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3205 | |
| Common | 1177 | 26.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 851 | |
| J | 525 | |
| i | 525 | |
| e | 326 | 10.2% |
| n | 326 | 10.2% |
| p | 326 | 10.2% |
| s | 326 | 10.2% |
Common
| Value | Count | Frequency (%) |
| 652 | ||
| ' | 525 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4382 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 851 | |
| 652 | ||
| J | 525 | |
| ' | 525 | |
| i | 525 | |
| e | 326 | 7.4% |
| n | 326 | 7.4% |
| p | 326 | 7.4% |
| s | 326 | 7.4% |
nbtik
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 89 |
|---|---|
| Distinct (%) | 45.2% |
| Missing | 332 |
| Missing (%) | 62.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 866.94924 |
| Minimum | 0 |
|---|---|
| Maximum | 64400 |
| Zeros | 5 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.8 |
| Q1 | 15 |
| median | 35 |
| Q3 | 150 |
| 95-th percentile | 1260 |
| Maximum | 64400 |
| Range | 64400 |
| Interquartile range (IQR) | 135 |
Descriptive statistics
| Standard deviation | 5830.7733 |
|---|---|
| Coefficient of variation (CV) | 6.7256225 |
| Kurtosis | 85.585495 |
| Mean | 866.94924 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | 8.9725812 |
| Sum | 170789 |
| Variance | 33997917 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 13 | 2.5% |
| 30 | 12 | 2.3% |
| 10 | 11 | 2.1% |
| 15 | 10 | 1.9% |
| 5 | 9 | 1.7% |
| 200 | 8 | 1.5% |
| 50 | 8 | 1.5% |
| 100 | 7 | 1.3% |
| 40 | 7 | 1.3% |
| 0 | 5 | 0.9% |
| Other values (79) | 107 | 20.2% |
| (Missing) | 332 |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 1 | 0.2% |
| 2 | 4 | 0.8% |
| 3 | 4 | 0.8% |
| 4 | 4 | 0.8% |
| 5 | 9 | |
| 6 | 2 | 0.4% |
| 7 | 1 | 0.2% |
| 8 | 1 | 0.2% |
| 10 | 11 |
| Value | Count | Frequency (%) |
| 64400 | 1 | |
| 39000 | 1 | |
| 33000 | 1 | |
| 5400 | 1 | |
| 2100 | 1 | |
| 2000 | 1 | |
| 1600 | 1 | |
| 1500 | 1 | |
| 1400 | 1 | |
| 1300 | 1 |
instap
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 16 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 8 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3071017 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 222 |
| Zeros (%) | 42.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.8136153 |
|---|---|
| Coefficient of variation (CV) | 1.4555389 |
| Kurtosis | 22.238753 |
| Mean | 3.3071017 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 3.4944337 |
| Sum | 1723 |
| Variance | 23.170892 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 222 | |
| 5 | 108 | |
| 10 | 49 | 9.3% |
| 2 | 41 | 7.8% |
| 3 | 36 | 6.8% |
| 1 | 27 | 5.1% |
| 15 | 10 | 1.9% |
| 4 | 9 | 1.7% |
| 7 | 4 | 0.8% |
| 6 | 4 | 0.8% |
| Other values (6) | 11 | 2.1% |
| (Missing) | 8 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 222 | |
| 1 | 27 | 5.1% |
| 2 | 41 | 7.8% |
| 3 | 36 | 6.8% |
| 4 | 9 | 1.7% |
| 5 | 108 | |
| 6 | 4 | 0.8% |
| 7 | 4 | 0.8% |
| 8 | 3 | 0.6% |
| 9 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 50 | 1 | 0.2% |
| 30 | 3 | 0.6% |
| 25 | 1 | 0.2% |
| 20 | 2 | 0.4% |
| 15 | 10 | 1.9% |
| 10 | 49 | |
| 9 | 1 | 0.2% |
| 8 | 3 | 0.6% |
| 7 | 4 | 0.8% |
| 6 | 4 | 0.8% |
snapp
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 11 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 13 |
| Missing (%) | 2.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.72286822 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 421 |
| Zeros (%) | 79.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 5 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.2021041 |
|---|---|
| Coefficient of variation (CV) | 3.0463424 |
| Kurtosis | 29.717096 |
| Mean | 0.72286822 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.8115274 |
| Sum | 373 |
| Variance | 4.8492624 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 421 | |
| 2 | 26 | 4.9% |
| 1 | 23 | 4.3% |
| 5 | 21 | 4.0% |
| 3 | 9 | 1.7% |
| 10 | 8 | 1.5% |
| 8 | 2 | 0.4% |
| 20 | 2 | 0.4% |
| 4 | 2 | 0.4% |
| 15 | 1 | 0.2% |
| (Missing) | 13 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 421 | |
| 1 | 23 | 4.3% |
| 2 | 26 | 4.9% |
| 3 | 9 | 1.7% |
| 4 | 2 | 0.4% |
| 5 | 21 | 4.0% |
| 7 | 1 | 0.2% |
| 8 | 2 | 0.4% |
| 10 | 8 | 1.5% |
| 15 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 20 | 2 | 0.4% |
| 15 | 1 | 0.2% |
| 10 | 8 | 1.5% |
| 8 | 2 | 0.4% |
| 7 | 1 | 0.2% |
| 5 | 21 | |
| 4 | 2 | 0.4% |
| 3 | 9 | 1.7% |
| 2 | 26 | |
| 1 | 23 |
googp
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 24 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 10 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.8073218 |
| Minimum | 0 |
|---|---|
| Maximum | 150 |
| Zeros | 143 |
| Zeros (%) | 27.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 5 |
| Q3 | 10 |
| 95-th percentile | 25 |
| Maximum | 150 |
| Range | 150 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 12.69563 |
|---|---|
| Coefficient of variation (CV) | 1.6261184 |
| Kurtosis | 43.218799 |
| Mean | 7.8073218 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 5.351079 |
| Sum | 4052 |
| Variance | 161.17902 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 143 | |
| 5 | 106 | |
| 10 | 96 | |
| 15 | 36 | 6.8% |
| 2 | 34 | 6.4% |
| 20 | 20 | 3.8% |
| 1 | 19 | 3.6% |
| 3 | 16 | 3.0% |
| 50 | 9 | 1.7% |
| 30 | 8 | 1.5% |
| Other values (14) | 32 | 6.0% |
| (Missing) | 10 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 143 | |
| 1 | 19 | 3.6% |
| 2 | 34 | 6.4% |
| 3 | 16 | 3.0% |
| 4 | 3 | 0.6% |
| 5 | 106 | |
| 6 | 1 | 0.2% |
| 7 | 6 | 1.1% |
| 8 | 6 | 1.1% |
| 9 | 2 | 0.4% |
| Value | Count | Frequency (%) |
| 150 | 1 | 0.2% |
| 100 | 2 | 0.4% |
| 80 | 1 | 0.2% |
| 50 | 9 | 1.7% |
| 40 | 2 | 0.4% |
| 35 | 1 | 0.2% |
| 30 | 8 | 1.5% |
| 25 | 3 | 0.6% |
| 20 | 20 | |
| 15 | 36 |
googmp
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 20 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 10 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.955684 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 190 |
| Zeros (%) | 35.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 7 |
| 95-th percentile | 15 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 8.5254537 |
|---|---|
| Coefficient of variation (CV) | 1.7203384 |
| Kurtosis | 46.365913 |
| Mean | 4.955684 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.4879474 |
| Sum | 2572 |
| Variance | 72.68336 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 190 | |
| 5 | 77 | |
| 10 | 67 | 12.7% |
| 2 | 57 | 10.8% |
| 3 | 27 | 5.1% |
| 1 | 26 | 4.9% |
| 15 | 22 | 4.2% |
| 20 | 14 | 2.6% |
| 8 | 10 | 1.9% |
| 7 | 9 | 1.7% |
| Other values (10) | 20 | 3.8% |
| (Missing) | 10 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 190 | |
| 1 | 26 | 4.9% |
| 2 | 57 | 10.8% |
| 3 | 27 | 5.1% |
| 4 | 6 | 1.1% |
| 5 | 77 | |
| 6 | 2 | 0.4% |
| 7 | 9 | 1.7% |
| 8 | 10 | 1.9% |
| 9 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 100 | 1 | 0.2% |
| 80 | 1 | 0.2% |
| 60 | 1 | 0.2% |
| 50 | 2 | 0.4% |
| 30 | 4 | 0.8% |
| 25 | 1 | 0.2% |
| 20 | 14 | 2.6% |
| 15 | 22 | 4.2% |
| 13 | 1 | 0.2% |
| 10 | 67 |
CS05
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 5 |
| Missing (%) | 0.9% |
| Memory size | 4.3 KiB |
| Assez bien | |
|---|---|
| Très bien | |
| Pas bien | 27 |
| Pas intégré.e du tout | 4 |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.5954198 |
| Min length | 8 |
Characters and Unicode
| Total characters | 5028 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Assez bien |
|---|---|
| 2nd row | Assez bien |
| 3rd row | Très bien |
| 4th row | Assez bien |
| 5th row | Assez bien |
Common Values
| Value | Count | Frequency (%) |
| Assez bien | 291 | |
| Très bien | 202 | |
| Pas bien | 27 | 5.1% |
| Pas intégré.e du tout | 4 | 0.8% |
| (Missing) | 5 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bien | 520 | |
| assez | 291 | |
| très | 202 | 19.1% |
| pas | 31 | 2.9% |
| intégré.e | 4 | 0.4% |
| du | 4 | 0.4% |
| tout | 4 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 815 | |
| s | 815 | |
| 532 | ||
| i | 524 | |
| n | 524 | |
| b | 520 | |
| A | 291 | 5.8% |
| z | 291 | 5.8% |
| r | 206 | 4.1% |
| è | 202 | 4.0% |
| Other values (10) | 308 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3968 | |
| Space Separator | 532 | 10.6% |
| Uppercase Letter | 524 | 10.4% |
| Other Punctuation | 4 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 815 | |
| s | 815 | |
| i | 524 | |
| n | 524 | |
| b | 520 | |
| z | 291 | 7.3% |
| r | 206 | 5.2% |
| è | 202 | 5.1% |
| a | 31 | 0.8% |
| t | 12 | 0.3% |
| Other values (5) | 28 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 291 | |
| T | 202 | |
| P | 31 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 532 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4492 | |
| Common | 536 | 10.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 815 | |
| s | 815 | |
| i | 524 | |
| n | 524 | |
| b | 520 | |
| A | 291 | 6.5% |
| z | 291 | 6.5% |
| r | 206 | 4.6% |
| è | 202 | 4.5% |
| T | 202 | 4.5% |
| Other values (8) | 102 | 2.3% |
Common
| Value | Count | Frequency (%) |
| 532 | ||
| . | 4 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4818 | |
| None | 210 | 4.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 815 | |
| s | 815 | |
| 532 | ||
| i | 524 | |
| n | 524 | |
| b | 520 | |
| A | 291 | 6.0% |
| z | 291 | 6.0% |
| r | 206 | 4.3% |
| T | 202 | 4.2% |
| Other values (8) | 98 | 2.0% |
None
| Value | Count | Frequency (%) |
| è | 202 | |
| é | 8 | 3.8% |
CS12
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 5 |
| Missing (%) | 0.9% |
| Memory size | 4.3 KiB |
| Parfois | |
|---|---|
| Souvent | |
| Jamais |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.9007634 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3616 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Souvent |
|---|---|
| 2nd row | Souvent |
| 3rd row | Parfois |
| 4th row | Souvent |
| 5th row | Parfois |
Common Values
| Value | Count | Frequency (%) |
| Parfois | 301 | |
| Souvent | 171 | |
| Jamais | 52 | 9.8% |
| (Missing) | 5 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| parfois | 301 | |
| souvent | 171 | |
| jamais | 52 | 9.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 472 | |
| a | 405 | |
| i | 353 | |
| s | 353 | |
| P | 301 | |
| r | 301 | |
| f | 301 | |
| S | 171 | 4.7% |
| u | 171 | 4.7% |
| v | 171 | 4.7% |
| Other values (5) | 617 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3092 | |
| Uppercase Letter | 524 | 14.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 472 | |
| a | 405 | |
| i | 353 | |
| s | 353 | |
| r | 301 | |
| f | 301 | |
| u | 171 | 5.5% |
| v | 171 | 5.5% |
| e | 171 | 5.5% |
| n | 171 | 5.5% |
| Other values (2) | 223 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 301 | |
| S | 171 | |
| J | 52 | 9.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3616 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 472 | |
| a | 405 | |
| i | 353 | |
| s | 353 | |
| P | 301 | |
| r | 301 | |
| f | 301 | |
| S | 171 | 4.7% |
| u | 171 | 4.7% |
| v | 171 | 4.7% |
| Other values (5) | 617 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3616 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 472 | |
| a | 405 | |
| i | 353 | |
| s | 353 | |
| P | 301 | |
| r | 301 | |
| f | 301 | |
| S | 171 | 4.7% |
| u | 171 | 4.7% |
| v | 171 | 4.7% |
| Other values (5) | 617 |
resid2
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 3 |
| Missing (%) | 0.6% |
| Memory size | 4.3 KiB |
| Bordeaux | |
|---|---|
| Pessac / Talence / Gradignan | |
| Autre commune |
Length
| Max length | 28 |
|---|---|
| Median length | 13 |
| Mean length | 17.676806 |
| Min length | 8 |
Characters and Unicode
| Total characters | 9298 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Bordeaux |
|---|---|
| 2nd row | Bordeaux |
| 3rd row | Bordeaux |
| 4th row | Pessac / Talence / Gradignan |
| 5th row | Autre commune |
Common Values
| Value | Count | Frequency (%) |
| Bordeaux | 249 | |
| Pessac / Talence / Gradignan | 247 | |
| Autre commune | 30 | 5.7% |
| (Missing) | 3 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 494 | ||
| bordeaux | 249 | |
| pessac | 247 | |
| talence | 247 | |
| gradignan | 247 | |
| autre | 30 | 1.9% |
| commune | 30 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1237 | |
| e | 1050 | |
| 1018 | ||
| n | 771 | 8.3% |
| r | 526 | 5.7% |
| c | 524 | 5.6% |
| d | 496 | 5.3% |
| / | 494 | 5.3% |
| s | 494 | 5.3% |
| u | 309 | 3.3% |
| Other values (12) | 2379 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6766 | |
| Uppercase Letter | 1020 | 11.0% |
| Space Separator | 1018 | 10.9% |
| Other Punctuation | 494 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1237 | |
| e | 1050 | |
| n | 771 | |
| r | 526 | |
| c | 524 | |
| d | 496 | |
| s | 494 | 7.3% |
| u | 309 | 4.6% |
| o | 279 | 4.1% |
| x | 249 | 3.7% |
| Other values (5) | 831 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 249 | |
| P | 247 | |
| T | 247 | |
| G | 247 | |
| A | 30 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1018 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 494 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7786 | |
| Common | 1512 | 16.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1237 | |
| e | 1050 | |
| n | 771 | |
| r | 526 | 6.8% |
| c | 524 | 6.7% |
| d | 496 | 6.4% |
| s | 494 | 6.3% |
| u | 309 | 4.0% |
| o | 279 | 3.6% |
| B | 249 | 3.2% |
| Other values (10) | 1851 |
Common
| Value | Count | Frequency (%) |
| 1018 | ||
| / | 494 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9298 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1237 | |
| e | 1050 | |
| 1018 | ||
| n | 771 | 8.3% |
| r | 526 | 5.7% |
| c | 524 | 5.6% |
| d | 496 | 5.3% |
| / | 494 | 5.3% |
| s | 494 | 5.3% |
| u | 309 | 3.3% |
| Other values (12) | 2379 |
resid6
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 5 |
| Missing (%) | 0.9% |
| Memory size | 4.3 KiB |
| Commune rurale | |
|---|---|
| - de 20 000 habitants | |
| 20 000 Ã 99 999 habitants | |
| 100 000 habitats et plus | |
| Autre pays que la France |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 20.183206 |
| Min length | 14 |
Characters and Unicode
| Total characters | 10576 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | - de 20 000 habitants |
|---|---|
| 2nd row | Agglomération parisienne |
| 3rd row | Commune rurale |
| 4th row | 100 000 habitats et plus |
| 5th row | - de 20 000 habitants |
Common Values
| Value | Count | Frequency (%) |
| Commune rurale | 177 | |
| - de 20 000 habitants | 107 | |
| 20 000 Ã 99 999 habitants | 91 | |
| 100 000 habitats et plus | 73 | |
| Autre pays que la France | 43 | 8.1% |
| Agglomération parisienne | 33 | 6.2% |
| (Missing) | 5 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 000 | 271 | |
| 20 | 198 | 9.5% |
| habitants | 198 | 9.5% |
| commune | 177 | 8.5% |
| rurale | 177 | 8.5% |
| 107 | 5.1% | |
| de | 107 | 5.1% |
| Ã | 91 | 4.4% |
| 99 | 91 | 4.4% |
| 999 | 91 | 4.4% |
| Other values (11) | 573 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1557 | ||
| 0 | 1157 | 10.9% |
| a | 914 | 8.6% |
| e | 729 | 6.9% |
| t | 691 | 6.5% |
| n | 517 | 4.9% |
| u | 513 | 4.9% |
| r | 506 | 4.8% |
| 9 | 455 | 4.3% |
| s | 420 | 4.0% |
| Other values (20) | 3117 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6733 | |
| Decimal Number | 1883 | 17.8% |
| Space Separator | 1557 | 14.7% |
| Uppercase Letter | 296 | 2.8% |
| Dash Punctuation | 107 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 914 | |
| e | 729 | |
| t | 691 | |
| n | 517 | 7.7% |
| u | 513 | 7.6% |
| r | 506 | 7.5% |
| s | 420 | 6.2% |
| m | 387 | 5.7% |
| i | 370 | 5.5% |
| l | 326 | 4.8% |
| Other values (11) | 1360 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1157 | |
| 9 | 455 | 24.2% |
| 2 | 198 | 10.5% |
| 1 | 73 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 177 | |
| A | 76 | |
| F | 43 | 14.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1557 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 107 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7029 | |
| Common | 3547 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 914 | |
| e | 729 | |
| t | 691 | |
| n | 517 | 7.4% |
| u | 513 | 7.3% |
| r | 506 | 7.2% |
| s | 420 | 6.0% |
| m | 387 | 5.5% |
| i | 370 | 5.3% |
| l | 326 | 4.6% |
| Other values (14) | 1656 |
Common
| Value | Count | Frequency (%) |
| 1557 | ||
| 0 | 1157 | |
| 9 | 455 | 12.8% |
| 2 | 198 | 5.6% |
| - | 107 | 3.0% |
| 1 | 73 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10452 | |
| None | 124 | 1.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1557 | ||
| 0 | 1157 | 11.1% |
| a | 914 | 8.7% |
| e | 729 | 7.0% |
| t | 691 | 6.6% |
| n | 517 | 4.9% |
| u | 513 | 4.9% |
| r | 506 | 4.8% |
| 9 | 455 | 4.4% |
| s | 420 | 4.0% |
| Other values (18) | 2993 |
None
| Value | Count | Frequency (%) |
| Ã | 91 | |
| é | 33 | 26.6% |
| random_id4 | nbinsta | nbtwit | nbsnap | nbtik | instap | snapp | googp | googmp | survey | tps_rs | instagra | snapchat | tiktok | CS05 | CS12 | resid2 | resid6 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| random_id4 | 1.000 | 0.001 | 0.045 | 0.087 | 0.001 | -0.016 | -0.006 | -0.008 | -0.005 | 0.000 | 0.070 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.077 | 0.000 | 0.000 |
| nbinsta | 0.001 | 1.000 | 0.279 | 0.302 | 0.534 | 0.182 | 0.142 | 0.076 | -0.002 | 0.154 | 0.040 | 1.000 | 0.134 | 0.170 | 0.223 | 0.000 | 0.000 | 0.173 | 0.080 |
| nbtwit | 0.045 | 0.279 | 1.000 | 0.134 | 0.271 | -0.167 | 0.034 | -0.141 | -0.161 | 0.000 | 0.020 | 0.000 | 1.000 | 0.000 | 0.000 | 0.148 | 0.082 | 0.000 | 0.114 |
| nbsnap | 0.087 | 0.302 | 0.134 | 1.000 | 0.273 | -0.106 | 0.034 | -0.043 | -0.092 | 0.083 | 0.000 | 0.160 | 0.000 | 1.000 | 0.000 | 0.000 | 0.048 | 0.000 | 0.000 |
| nbtik | 0.001 | 0.534 | 0.271 | 0.273 | 1.000 | 0.129 | 0.127 | -0.021 | -0.116 | 0.062 | 0.000 | 0.000 | 0.000 | 0.035 | 1.000 | 0.000 | 0.000 | 0.154 | 0.000 |
| instap | -0.016 | 0.182 | -0.167 | -0.106 | 0.129 | 1.000 | 0.313 | 0.285 | 0.286 | 0.000 | 0.075 | 0.000 | 0.000 | 0.000 | 0.148 | 0.055 | 0.066 | 0.111 | 0.000 |
| snapp | -0.006 | 0.142 | 0.034 | 0.034 | 0.127 | 0.313 | 1.000 | 0.193 | 0.167 | 0.000 | 0.000 | 0.000 | 0.000 | 0.174 | 0.168 | 0.000 | 0.124 | 0.068 | 0.063 |
| googp | -0.008 | 0.076 | -0.141 | -0.043 | -0.021 | 0.285 | 0.193 | 1.000 | 0.582 | 0.050 | 0.000 | 0.000 | 0.064 | 0.054 | 0.046 | 0.053 | 0.000 | 0.076 | 0.000 |
| googmp | -0.005 | -0.002 | -0.161 | -0.092 | -0.116 | 0.286 | 0.167 | 0.582 | 1.000 | 0.000 | 0.000 | 0.000 | 0.015 | 0.000 | 0.037 | 0.000 | 0.000 | 0.100 | 0.014 |
| survey | 0.000 | 0.154 | 0.000 | 0.083 | 0.062 | 0.000 | 0.000 | 0.050 | 0.000 | 1.000 | 0.000 | 0.045 | 0.062 | 0.086 | 0.059 | 0.019 | 0.077 | 0.308 | 0.000 |
| tps_rs | 0.070 | 0.040 | 0.020 | 0.000 | 0.000 | 0.075 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.280 | 0.185 | 0.147 | 0.319 | 0.077 | 0.093 | 0.075 | 0.020 |
| instagra | 0.000 | 1.000 | 0.000 | 0.160 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.045 | 0.280 | 1.000 | 0.074 | 0.126 | 0.101 | 0.088 | 0.182 | 0.065 | 0.071 |
| 0.000 | 0.134 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.064 | 0.015 | 0.062 | 0.185 | 0.074 | 1.000 | 0.144 | 0.187 | 0.053 | 0.000 | 0.000 | 0.084 | |
| snapchat | 0.000 | 0.170 | 0.000 | 1.000 | 0.035 | 0.000 | 0.174 | 0.054 | 0.000 | 0.086 | 0.147 | 0.126 | 0.144 | 1.000 | 0.259 | 0.073 | 0.000 | 0.000 | 0.215 |
| tiktok | 0.000 | 0.223 | 0.000 | 0.000 | 1.000 | 0.148 | 0.168 | 0.046 | 0.037 | 0.059 | 0.319 | 0.101 | 0.187 | 0.259 | 1.000 | 0.000 | 0.070 | 0.000 | 0.000 |
| CS05 | 0.000 | 0.000 | 0.148 | 0.000 | 0.000 | 0.055 | 0.000 | 0.053 | 0.000 | 0.019 | 0.077 | 0.088 | 0.053 | 0.073 | 0.000 | 1.000 | 0.261 | 0.112 | 0.095 |
| CS12 | 0.077 | 0.000 | 0.082 | 0.048 | 0.000 | 0.066 | 0.124 | 0.000 | 0.000 | 0.077 | 0.093 | 0.182 | 0.000 | 0.000 | 0.070 | 0.261 | 1.000 | 0.000 | 0.038 |
| resid2 | 0.000 | 0.173 | 0.000 | 0.000 | 0.154 | 0.111 | 0.068 | 0.076 | 0.100 | 0.308 | 0.075 | 0.065 | 0.000 | 0.000 | 0.000 | 0.112 | 0.000 | 1.000 | 0.176 |
| resid6 | 0.000 | 0.080 | 0.114 | 0.000 | 0.000 | 0.000 | 0.063 | 0.000 | 0.014 | 0.000 | 0.020 | 0.071 | 0.084 | 0.215 | 0.000 | 0.095 | 0.038 | 0.176 | 1.000 |
| random_id4 | survey | tps_rs | instagra | nbinsta | nbtwit | snapchat | nbsnap | tiktok | nbtik | instap | snapp | googp | googmp | CS05 | CS12 | resid2 | resid6 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 37320 | ScPoBx_3A | 2.0 | J'ai | 188.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | Je n'ai pas | NaN | 5.0 | 5.0 | 0.0 | 0.0 | Assez bien | Souvent | Bordeaux | - de 20 000 habitants |
| 1 | 75227 | ScPoBx_3A | 3.0 | J'ai | 400.0 | Je n'ai pas | NaN | J'ai | 40.0 | J'ai | 100.0 | 0.0 | 0.0 | 0.0 | 0.0 | Assez bien | Souvent | Bordeaux | Agglomération parisienne |
| 2 | 78695 | ScPoBx_3A | 2.0 | J'ai | 353.0 | J'ai | 12.0 | Je n'ai pas | NaN | J'ai | 271.0 | 3.0 | 0.0 | 10.0 | 3.0 | Très bien | Parfois | Bordeaux | Commune rurale |
| 3 | 80831 | ScPoBx_1A | 2.0 | J'ai | 450.0 | Je n'ai pas | NaN | J'ai | 4.0 | J'ai | 11.0 | 5.0 | 0.0 | 13.0 | 13.0 | Assez bien | Souvent | Pessac / Talence / Gradignan | 100 000 habitats et plus |
| 4 | 140145 | ScPoBx_1A | 4.0 | J'ai | 120.0 | Je n'ai pas | NaN | J'ai | 10.0 | J'ai | 10.0 | 0.0 | 0.0 | 7.0 | 15.0 | Assez bien | Parfois | Autre commune | - de 20 000 habitants |
| 5 | 142591 | ScPoBx_1A | 4.0 | J'ai | 1400.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | J'ai | 230.0 | NaN | NaN | NaN | NaN | Assez bien | Souvent | Pessac / Talence / Gradignan | Agglomération parisienne |
| 6 | 154015 | ScPoBx_3A | 3.0 | J'ai | 450.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | Je n'ai pas | NaN | 3.0 | 0.0 | 10.0 | 1.0 | Assez bien | Parfois | Autre commune | 20 000 Ã 99 999 habitants |
| 7 | 157492 | ScPoBx_3A | 4.0 | J'ai | 170.0 | J'ai | 3.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | 0.0 | 0.0 | 10.0 | 10.0 | Assez bien | Parfois | Pessac / Talence / Gradignan | Commune rurale |
| 8 | 180228 | ScPoBx_1A | 1.0 | J'ai | 200.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | J'ai | 15.0 | 5.0 | 0.0 | 15.0 | 20.0 | Assez bien | Parfois | Bordeaux | Commune rurale |
| 9 | 215268 | ScPoBx_1A | 2.0 | J'ai | 250.0 | Je n'ai pas | NaN | J'ai | 20.0 | Je n'ai pas | NaN | 2.0 | 1.0 | 5.0 | 2.0 | Assez bien | Parfois | Pessac / Talence / Gradignan | Agglomération parisienne |
| random_id4 | survey | tps_rs | instagra | nbinsta | nbtwit | snapchat | nbsnap | tiktok | nbtik | instap | snapp | googp | googmp | CS05 | CS12 | resid2 | resid6 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 519 | 9741187 | ScPoBx_3A | 1.0 | J'ai | 130.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | Je n'ai pas | NaN | 3.0 | NaN | NaN | NaN | Assez bien | Souvent | Pessac / Talence / Gradignan | Autre pays que la France |
| 520 | 9745640 | ScPoBx_1A | 2.0 | J'ai | 500.0 | Je n'ai pas | NaN | J'ai | 200.0 | Je n'ai pas | NaN | 0.0 | 0.0 | 10.0 | 0.0 | Assez bien | Parfois | Pessac / Talence / Gradignan | - de 20 000 habitants |
| 521 | 9801565 | ScPoBx_1A | 3.0 | J'ai | 433.0 | Je n'ai pas | NaN | J'ai | NaN | J'ai | 22.0 | 5.0 | 5.0 | 5.0 | 0.0 | Assez bien | Parfois | Pessac / Talence / Gradignan | 20 000 Ã 99 999 habitants |
| 522 | 9816676 | ScPoBx_3A | 1.0 | J'ai | 166.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | Je n'ai pas | NaN | 0.0 | 0.0 | 5.0 | 2.0 | Très bien | Parfois | Bordeaux | Agglomération parisienne |
| 523 | 9897608 | ScPoBx_1A | 2.0 | J'ai | 180.0 | J'ai | 4.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | 5.0 | 0.0 | 0.0 | 2.0 | Assez bien | Parfois | Bordeaux | 100 000 habitats et plus |
| 524 | 9928746 | ScPoBx_1A | 3.0 | J'ai | 100.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | J'ai | 4.0 | 0.0 | 0.0 | 0.0 | 0.0 | Très bien | Parfois | Pessac / Talence / Gradignan | Agglomération parisienne |
| 525 | 9940978 | ScPoBx_1A | 1.0 | J'ai | 820.0 | J'ai | 7.0 | Je n'ai pas | NaN | J'ai | 398.0 | 0.0 | 0.0 | 5.0 | 10.0 | Très bien | Parfois | Pessac / Talence / Gradignan | - de 20 000 habitants |
| 526 | 9947814 | ScPoBx_3A | 1.0 | J'ai | 717.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | J'ai | 1163.0 | 10.0 | 0.0 | 10.0 | 0.0 | Très bien | Parfois | Bordeaux | Autre pays que la France |
| 527 | 9956964 | ScPoBx_1A | 1.0 | Je n'ai pas | NaN | Je n'ai pas | NaN | Je n'ai pas | NaN | Je n'ai pas | NaN | 0.0 | 0.0 | 0.0 | 0.0 | Assez bien | Parfois | Autre commune | - de 20 000 habitants |
| 528 | 9973576 | ScPoBx_3A | 2.0 | J'ai | 800.0 | J'ai | 150.0 | J'ai | 100.0 | Je n'ai pas | NaN | 1.0 | 1.0 | 0.0 | 0.0 | Très bien | Parfois | Bordeaux | Commune rurale |